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In a recent paper, Clark et al. (2020) analyze the relationship 
between socio-economic measures and estimates of average cog- 
nitive ability IQ’) in 140 countries. According to the data pre- 
sented by the authors, several African, South Asian and Central 
American countries have an average IQ below 50 (i.e. intellectu- 
ally disabled, according to DSM diagnostic criteria). Moreover, 
according to the data presented by the authors, the average cog- 
nitive ability of adults in African nations is ~1.6 standard de- 
viations below the cognitive ability of European adults. These 
notions are incompatible with psychological science and all con- 
clusions drawn from these data are invalid. 
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Introduction 


IQ scores! are a poor measure of cognitive ability, prone 


to cultural biases in testing paradigms and highly sensitive 
to systematic biases in population sampling (Gould, 1981; 
Richardson, 2002; Hampshire et al., 2012). Biased and in- 
correct estimates of average IQ in non-European populations 
historically was — and still is — a core component of scientific 
racism within psychology (Belkhir, 1994). In a recent paper, 
Clark et al. (2020) investigate the relationship between IQ 
scores as an estimates of average cognitive ability and other 
socio-economic statistics in 140 countries (Fig. la). The au- 
thors present estimates of ‘national IQ’. The definition of “na- 
tional IQ’ is not given in Clark et al. (2020), but can be found 
in Lynn and Meisenberg (2010): ‘national IQ’ is defined such 
that the ‘national IQ’ of the United Kingdom is 100, and the 
standard deviation of IQ scores within the United Kingdom 
is 15. In other words, national IQ’ is claimed to be stan- 
dardized such that these scores allow direct comparison of 
cognitive ability between populations. Thus, if a country has 
a ‘national IQ’ of 115, it means that — on average — an indi- 
vidual from that country would receive an IQ score of 115 in 
the United Kingdom. 

Within a comparable population, as a rough indicator, ex- 
tremely low IQ scores can be a sign of clinically significant 
cognitive impairments, such as intellectual disability (Boat 
and Wu, 2015). In DSM-IV (APA, 1998), intellectual dis- 
ability was partly diagnosed by an IQ score of below 70 (i.e. 


‘An IQ score is a measure of performance on a cognitive test, defined in 
relation to the performance of other individuals on the same test. Mathemat- 
ically, the IQ score is a re-scaled z-score, scaled such that an individual with 
a performance equal to the population mean receives an IQ score of 100. IQ 
scores are usually normalized such that the standard deviation is 15 (individ- 
uals performing one standard deviation above average are assigned a score 
of 115). 
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Fig. 1. Estimates of cognitive ability in the Clark et al. (2020) data. a, Countries 
included in the Clark et al. (2020) analysis of ‘national IQ’. b-c, Distribution of na- 
tional |Q estimates across continents in the ‘NIQ QNWSAS’ and ‘NIQ_R’ datasets. 
Dot/triangles indicate the average national IQ of one country, dotted vertical line 
indicates the DSM-IV criterion for intellectual disability. 


Severity Cases DSM-IV DSM-5 

Mild 1.96% IQ: 50-69 Can live independently with minimum 
levels of support. 

Moderate 0.23% IQ: 36-49 Independent living may be achieved 
with moderate levels of support, such 
as those available in group homes. 

Severe 0.08% IQ: 20-35 Requires daily assistance with self- 
care activities and safety supervision. 

Profound 0.03% IQ< 20 Requires 24-hour care. 


Table 1. Diagnostic criteria for intellectual disability in DSM-IV and DSM-5 
‘Cases’ indicates approximate incidence of mild, moderate, severe, and profound 
intellectual disability in the population. Table adapted from Boat and Wu (2015). 


two standard deviations below the mean of 100 in the popu- 
lation). In recognition of the methodological issues with IQ 
scores aS a measure of cognitive ability in general, in DSM-5 
(APA, 2013), the use of IQ scores were replaced by descrip- 
tions of the level of support needed by an individual to func- 
tion (Table 1). Individuals with intellectual disabilities are 
rare in the population, with ~ 1.96% displaying mild symp- 
toms (IQ: 50-69) and ~ 0.35% displaying moderate, severe 
or profound symptoms (IQ < 50, unable to live independently 
without moderate levels of support). 


Issues with the Clark et al. (2020) data 


Clark et al. (2020) present three datasets of ‘na- 
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tional IQ’ (NIQ’): ‘NIQ_LV12GeoIQ’, ‘NIQ_R’ and 
‘NIQ_QNWSAS’. The dataset ‘NIQ_ LV12GeoIQ’ is de- 
scribed as being based on data sourced from Lynn and 
Vanhanen (2012). The dataset ‘NIQ_R’ is sourced from 
Becker and Rindermann (2016), which bases the esti- 
mates on Lynn and Vanhanen (2012). The estimates 
in ‘NIQ_LV12GeoIQ’ and ‘NIQ_R’ are almost identical. 
The dataset “NIQ_QNWSAS’ is from an online repository 
(Becker, 2019) associated with Lynn and Becker (2019), de- 
scribed by Clark et al. (2020) as being based on Lynn and 
Vanhanen (2012) combined with “data from school assess- 
ment studies—mainly Progress in International Reading Lit- 
erary Study (PIRLS), Programme for International Student 
Assessment (PISA), and Trends in International Mathemat- 
ics and Science Study (TIMSS)”. 

We extracted the estimates of national IQ from the Clark 
et al. (2020) online supplementary data and discovered se- 
rious flaws. In both the ‘NIQ QNWSAS’ dataset (Fig. 1b) 
and the ‘NIQ_LV12GeoIQ’/‘NIQ_R’ datasets (Fig. Ic), sev- 
eral countries were assigned an average national IQ scores 
well below 70, the DSM-IV diagnostic criterion for intellec- 
tual disability. For example, in the ‘NIQ_QNWSAS’ dataset, 
several African, South Asian and Central American countries 
have an average IQ below 50, such as Nepal (national IQ: 
43.0), Sierra Leone (national IQ: 45.1), Guatemala (national 
IQ: 47.7) and Gambia (national IQ: 49.8). These estimates 
would seem to suggest that a majority of the population in 
these countries are moderately, severely or profoundly cog- 
nitively impaired (cf. Table 1). This notion is incompatible 
with psychological science and there is no doubt that these 
estimates are wrong. 

Comparing the data across continents, we observed a 
strong systematic bias in the estimates of cognitive ability. 
According to the estimates analyzed by Clark et al. (2020), 
the national [Q of African countries (Fig. 2a) is 24.4 points 
lower than the national IQ of European countries (mean 
IQ: 69.6 v. 94.9 for the ‘NIQ QNWSAS’ dataset, 72.3 v. 
96.2 for the ‘NIQ_R’ dataset, both p < 0.001, N = 29/41, 
Mann-Whitney U-test, Fig. 1b-c). This is a huge mean dif- 
ference (~1.6 standard deviations), similar in magnitude to 
the the DSM-IV diagnostic criterion for intellectual disability 
(2 standard deviation). There is no doubt that these estimates 
of national IQ for African nations are incorrect. 

In addition to the impossibly low estimates of na- 
tional IQ, we also note that there are large, unexplained 
disagreements between the ‘NIQ QNWSAS’ dataset and 
the ‘NIQ_LV12GeoIQ’/‘NIQ_R’ datasets, for example for 
Nicaragua (AIQ = 28.7), Djibouti (AIQ = 19.4), Haiti (AIQ 
= 20.7), Sierra Leone (AIQ = 17.2), Guatemala (AIQ = 32.2) 
and Honduras (AIQ = 18.0) (Fig. 2a). These large discrep- 
ancies are a reflection of the fact that the methods used to 
estimate national IQ are flawed and unreliable. 


Why are these IQ estimates so wrong? 


All estimates presented by Clark et al. (2020) are based on 
estimates in Lynn and Vanhanen (2012), which is a revised 
and extended version of estimates first presented in Lynn 
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and Vanhanen (2002). Several authors have cataloged grave 
methodological issues with previous iterations of the national 
IQ datasets by Lynn and colleagues, including the use of non- 
representative samples, biased inclusion and exclusion of 
data, and miscalculations (Richardson, 2004; Wicherts et al., 
2010c,a,d; Loehlin, 2007; Dickins et al., 2007). 

Evidently, the data quality issues persist in the most recent 
iteration of the national IQ estimates (the “NIQ QNWSAS’ 
dataset, Becker, 2019). A full review of the methodological 
issues is beyond the scope of this note, but to give a sense of 
the issues with data quality, we list a few examples. The esti- 
mate for Somalia (national [Q: 67.7) is based on a single sam- 
ple of Somali refugees aged 8-18 tested in the Daadab refugee 
camp in Kenya (Bakhiet et al., 2017). The estimate for Haiti 
(national IQ: 83.9) is based on a sample of 133 rural 6-year- 
olds (de Ronceray and Petit-Frere, 1975) and a sample of 54 
individuals with a mean age of 10 years (Cotten, 1985). The 
estimate for Botswana (national IQ: 69.5) is based on a single 
sample of 104 ethnically Tswana high-school students aged 
17-20 (natively Tswana-speaking), tested in English in South 
Africa (Maqsud, 1997). The estimate for Burkina Faso (na- 
tional IQ: 73.8) is based on a single sample of rural children 
aged 5-15 (Bagby, 2011). The estimate for Nepal (national 
IQ: 43.0) is based on samples of rural vitamin-A-deficient 
children (Buckley et al., 2013), rural mothers in experimen- 
tal nutrient supplement trials (Christian et al., 2010) and ru- 
ral rice paddy and wheat farmers surveyed in the agricultural 
year 1977-1978 (Jamison and Moock, 1984). The estimate 
for Nabimia (national IQ: 66.2) is based on a single sample 
of 103 Herero-speaking children aged 7-12 (Veii, 2003). The 
estimate for Saint Vincent and the Grenadines (national IQ: 
63.4) is based on a single sample of 174 rural children aged 
8-11 (Durbrow et al., 2002). The estimate for Malawi (na- 
tional IQ: 69.7) is based on a sample of children aged 7-14 
(7.5% never having attended school, 14.6% school dropouts) 
from villages in a rural district where 76.1% of the popula- 
tion lives below the World Bank’s international poverty line 
(Van de Vijver and Brouwers, 2009). 

From these examples, it is clear that the problem of ’na- 
tional IQ’ as a metric of cognitive ability is not just that IQ 
tests are inherently culturally biased, or that the mathematical 
strategy used to standardize raw IQ scores from one country 
to acommon ‘national IQ’-standard is unreliable. Rather, the 
underlying raw data 1s so extremely unrepresentative that it 1s 
impossible to correct these data for biased sampling. 


Discussion 


The notion that the average IQ of whole countries is below 50 
(intellectually disabled, unable to live independently without 
moderate levels of support, according to DSM-IV diagnos- 
tic criteria) is evidently wrong. It is also impossible that the 
average IQ of African nations is ~1.6 standard deviations be- 
low the average IQ of European nations. As pointed out by 
Wicherts et al. (2010b), presenting such low IQ estimates is 
equivalent to claiming that African adults have the cognitive 
ability of an average 11-year-old European child. This notion 
is incompatible with psychological science. 
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Fig. 2. Average IQ of African and North American countries in the Clark et al. (2020) data. Dots/squares/triangles indicate IQ estimates in the ‘NIQ_QNWSAS’, 
‘NIQ_LV12GeolQ’ and‘NIQ_R’ datasets. Dotted vertical line indicates the DSM-IV criterion for intellectual disability. 


About the quality of the IQ estimates, Clark et al. (2020) 
write that 


To our knowledge, these are the most complete 
and well-validated country-level IQ data avail- 
able (Lynn & Meisenberg, 2010), but the quality 
of the data varies by country. [...] Note also that 
noise in the data, if anything, should obscure our 
hypothesized pattern of results. 


This is incorrect. As outlined in the previous section, for 
many countries outside Europe, the Clark et al. (2020) IQ 
estimates are based on samples of poorly educated individu- 
als (including children and refugees), often living below the 
poverty line. Extrapolating national IQ’ estimates from bi- 
ased and unrepresentative samples results in biased and in- 
accurate data. The errors in the Clark et al. (2020) data do 
not constitute random noise, which might simply mask true 
statistical interactions. These data are so biased that any con- 
clusions drawn from them are invalid. 

There is no scientific justification for the discussion by 
Clark et al. (2020) of “societies with relatively high average 
intelligence’, “societies with relatively low average intelli- 
gence.”, “populations with relatively lower mean cognitive 
ability” or “more cognitively advantaged populations”’. 

In the case of Clark et al. (2020), pre-publication peer re- 
view failed to detect that serious flaws in their underlying 
data render their conclusions invalid. As a practical sug- 
gestion to help avoid such failures in the future, we suggest 
that the editors of Psychological Science mandate that au- 
thors not just provide summary statistics in tables, but also 
include plots showing the underlying raw data points. The 
gross flaws in the data reported by Clark et al. (2020) Gm- 
possibly low IQ estimates) would have been immediately ob- 
vious to any referee presented with such a plot. Plotting raw 
data points is standard practice in many experimental fields 
and major journals (e.g. Nature Neuroscience) do not accept 
manuscripts with only summary statistics. 


Conclusion 


The estimates of cognitive ability presented by Clark et al. 
(2020) are so flawed that it is impossible to draw any conclu- 
sions from their analysis. The paper should be retracted. 
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METHODS 

Estimates of the incidence of intellectual disability shown in Table 1 are calculated 
from the relative incidence of mild (85%), moderate (10%), severe (3.5%) and pro- 
found (1.5%) intellectual disability (Boat and Wu, 2015, chap. 9), combined with 
an overall incidence of intellectual disability in the population of 0.9-3.7% 2.38% 
(Boat and Wu, 2015, chap. 15). A similar estimates can be reached by assuming 
that IQ scores are roughly normally distributed. For a normally distributed variable, 
2.3% of the probability mass is more than two standard deviations below the mean. 
Datasets from Clark et al. (2020) were downloaded from the online supporting ma- 
terial available at https://osf.io/ecdrt/. All code for plotting and analysis are available 
as a Jupyter Notebook (.ipybt notebook and .html render) as supplementary mate- 
rial. 
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